In mathematics, the Hermite polynomials are a classical orthogonal polynomial sequence that arise in probability, such as the Edgeworth series; in combinatorics, as an example of an Appell sequence, obeying the umbral calculus; and in physics, where they give rise to the eigenstates of the quantum harmonic oscillator. They are named in honor of Charles Hermite.
Contents |
(the "probabilists' Hermite polynomials"), or sometimes by
(the "physicists' Hermite polynomials"). These two definitions are not exactly equivalent; either is a rescaling of the other, to wit
These are Hermite polynomial sequences of different variances; see the material on variances below.
Below, we usually follow the first convention. That convention is often preferred by probabilists because
is the probability density function for the normal distribution with expected value 0 and standard deviation 1.
The first eleven probabilists' Hermite polynomials are:
and the first eleven physicists' Hermite polynomials are:
Hn is a polynomial of degree n. The probabilists' version has leading coefficient 1, while the physicists' version has leading coefficient 2n.
Hn(x) is an nth-degree polynomial for n = 0, 1, 2, 3, .... These polynomials are orthogonal with respect to the weight function (measure)
or
i.e., we have
when m ≠ n. Furthermore,
or
The probabilist polynomials are thus orthogonal with respect to the standard normal probability density function.
The Hermite polynomials (probabilist or physicist) form an orthogonal basis of the Hilbert space of functions satisfying
in which the inner product is given by the integral including the Gaussian weight function w(x) defined in the preceding section,
An orthogonal basis for L2(R, w(x) dx) is a complete orthogonal system. For an orthogonal system, completeness is equivalent to the fact that the 0 function is the only function ƒ ∈ L2(R, w(x) dx) orthogonal to all functions in the system. Since the linear span of Hermite polynomials is the space of all polynomials, one has to show (in physicist case) that if ƒ satisfies
for every n ≥ 0, then ƒ = 0. One possible way to do it is to see that the entire function
vanishes identically. The fact that F(it) = 0 for every t real means that the Fourier transform of ƒ(x) exp(−x2) is 0, hence ƒ is 0 almost everywhere. Variants of the above completeness proof apply to other weights with exponential decay. In the Hermite case, it is also possible to prove an explicit identity that implies completeness (see "Completeness relation" below).
An equivalent formulation of the fact that Hermite polynomials are an orthogonal basis for L2(R, w(x) dx) consists in introducing Hermite functions (see below), and in saying that the Hermite functions are an orthonormal basis for L2(R).
The probabilists' Hermite polynomials are solutions of the differential equation
where λ is a constant, with the boundary conditions that u should be polynomially bounded at infinity. With these boundary conditions, the equation has solutions only if λ is a non-negative integer, and up to an overall scaling, the solution is uniquely given by u(x) = Hλ(x). Rewriting the differential equation as an eigenvalue problem
solutions are the eigenfunctions of the differential operator L. This eigenvalue problem is called the Hermite equation, although the term is also used for the closely related equation
whose solutions are the physicists' Hermite polynomials.
With more general boundary conditions, the Hermite polynomials can be generalized to obtain more general analytic functions Hλ(z) for λ a complex index. An explicit formula can be given in terms of a contour integral (Courant & Hilbert 1953).
The sequence of Hermite polynomials also satisfies the recursion
The Hermite polynomials constitute an Appell sequence, i.e., they are a polynomial sequence satisfying the identity
or equivalently,
(the equivalence of these last two identities may not be obvious, but its proof is a routine exercise).
It follows that the Hermite polynomials also satisfy the recurrence relation
These last relations, together with the initial polynomials H0(x) and H1(x), can be used in practice to compute the polynomials quickly.
The physicists' Hermite polynomials can be written explicitly as
for even values of n and
for odd values of n. These two equations may be combined into one using the floor function:
The probabilists' Hermite polynomials have similar formulas, which may be obtained from these by replacing the power of 2x with the corresponding power of (√2)x, and multiplying the entire sum by 2-n/2.
The Hermite polynomials are given by the exponential generating function
This equality is valid for all x, t complex, and can be obtained by writing the Taylor expansion at x of the entire function z → exp(−z2) (in physicist's case). One can also derive the (physicist's) generating function by using Cauchy's Integral Formula to write the Hermite polynomials as
Using this in the sum , one can evaluate the remaining integral using the calculus of residues and arrive at the desired generating function.
If X is a random variable with a normal distribution with standard deviation 1 and expected value μ then
The Hermite polynomials can be expressed as a special case of the Laguerre polynomials.
The Hermite polynomials can be expressed as a special case of the parabolic cylinder functions.
where is Whittaker's confluent hypergeometric function. Similarly,
where is Kummer's confluent hypergeometric function.
The probabilists' Hermite polynomials satisfy the identity
where D represents differentiation with respect to x, and the exponential is interpreted by expanding it as a power series. There are no delicate questions of convergence of this series when it operates on polynomials, since all but finitely many terms vanish.
Since the power series coefficients of the exponential are well known, and higher order derivatives of the monomial xn can be written down explicitly, this differential operator representation gives rise to a concrete formula for the coefficients of Hn that can be used to quickly compute these polynomials.
Since the formal expression for the Weierstrass transform W is eD2, we see that the Weierstrass transform of (√2)nHn(x/√2) is xn. Essentially the Weierstrass transform thus turns a series of Hermite polynomials into a corresponding Maclaurin series.
The existence of some formal power series g(D), with nonzero constant coefficient, such that Hn(x) = g(D)xn, is another equivalent to the statement that these polynomials form an Appell sequence. Since they are an Appell sequence they are a fortiori a Sheffer sequence.
The Hermite polynomials have a representation in terms of a contour integral, as
with the contour encircling the origin.
The (probabilists') Hermite polynomials defined above are orthogonal with respect to the standard normal probability distribution, whose density function is
which has expected value 0 and variance 1. One may speak of Hermite polynomials
of variance α, where α is any positive number. These are orthogonal with respect to the normal probability distribution whose density function is
They are given by
In particular, the physicists' Hermite polynomials are
If
then the polynomial sequence whose nth term is
is the umbral composition of the two polynomial sequences, and it can be shown to satisfy the identities
and
The last identity is expressed by saying that this parameterized family of polynomial sequences is a cross-sequence.
Since polynomial sequences form a group under the operation of umbral composition, one may denote by
the sequence that is inverse to the one similarly denoted but without the minus sign, and thus speak of Hermite polynomials of negative variance. For α > 0, the coefficients of Hn[−α](x) are just the absolute values of the corresponding coefficients of Hn[α](x).
These arise as moments of normal probability distributions: The nth moment of the normal distribution with expected value μ and variance σ2 is
where X is a random variable with the specified normal distribution. A special case of the cross-sequence identity then says that
One can define the Hermite functions from the physicists' polynomials:
Since these functions contain the square root of the weight function, and have been scaled appropriately, they are orthonormal:
and form an orthonormal basis of L2(R). This fact is equivalent to the corresponding statement for Hermite polynomials (see above).
The Hermite functions are closely related to the Whittaker function (Whittaker and Watson, 1962) :
and thereby to other parabolic cylinder functions. The Hermite functions satisfy the differential equation:
This equation is equivalent to the Schrödinger equation for a harmonic oscillator in quantum mechanics, so these functions are the eigenfunctions.
Following recursion relations of Hermite polynomials, the Hermite functions obey
The Hermite functions satisfy the following bound due to Harald Cramér[1][2]
for x real, where the constant K is less than 1.086435.
The Hermite functions are a set of eigenfunctions of the continuous Fourier transform. To see this, take the physicist's version of the generating function and multiply by exp(−x 2/2). This gives
Choosing the unitary representation of the Fourier transform, the Fourier transform of the left hand side is given by
The Fourier transform of the right hand side is given by
Equating like powers of t in the transformed versions of the left- and right-hand sides gives
The Hermite functions are therefore an orthonormal basis of L2(R) which diagonalizes the Fourier transform operator. In this case, we chose the unitary version of the Fourier transform, so the eigenvalues are (−i) n.
In the Hermite polynomial Hn(x) of variance 1, the absolute value of the coefficient of xk is the number of (unordered) partitions of an n-member set into k singletons and (n − k)/2 (unordered) pairs.
The following identity holds in the sense of distributions[3]
where δ is the Dirac delta function, (ψn) the Hermite functions, and δ(x − y) represents the Lebesgue measure on the line y = x in R2, normalized so that its projection on the horizontal axis is the usual Lebesgue measure. This distributional identity follows by letting u → 1 in Mehler's formula, valid when −1 < u < 1:
The function (x, y) → E(x, y; u) is the density for a Gaussian measure on R2 which is, when u is close to 1, very concentrated around the line y = x, and very spread out on that line. It follows that
when ƒ, g are continuous and compactly supported. This yields that ƒ can be expressed from the Hermite functions, as sum of a series of vectors in L2(R), namely
In order to prove the equality above for E(x, y; u), the Fourier transform of Gaussian functions will be used several times,
The Hermite polynomial is then represented as
With this representation for Hn(x) and Hn(y), one sees that
and this implies the desired result, using again the Fourier transform of Gaussian kernels after performing the substitution